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HOW TO USE THE SEDS PRODUCED BY SYNTHESIS MODELS (INSIDE AND OUTSIDE THE VO)? 
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ABSTRACT 



In this contribution we investigate how to describe the 
results and usage of evolutionary synthesis models. In 
particular, we look for an explicit and quantitative de- 
scription of the parameter space of synthesis models and 
the evaluation of their associated uncertainties and disper- 
sion. First, we need to understand what synthesis models 
actually compute: we show that a synthetic stellar pop- 
ulation with fixed physical parameters (age, metallicity, 
star formation history, initial mass function and size of 
the system) can only be described in terms of probability 
distributions (i.e. there is an intrinsic dispersion in any 
model). Second, we need to identify and characterize the 
coverage in the parameter space of the models (i.e. the 
combinations of input parameters that yield meaningful 
models) and the different sources of systematic errors. 
Third, we need a way to describe quantitatively the in- 
trinsic dispersion, the systematic error and the parameter 
space coverage of the models. 

Up to now, the parameter space coverage and uncertain- 
ties have been described qualitatively in the models' ref- 
erence papers, with potential misinterpretations by mod- 
els' users. We show how Virtual Observatory develop- 
ments enable a correct use of synthesis models to obtain 
accurate (and not simply precise) results. 

Key words: Synthesis models; Virtual Observatory. 



1. INTRODUCTION 



The aim of the Virtual Observatory (VO) is to provide an 
environment to produce more accurate science. Not only 
does the VO provide an opportunity for technical devel- 
opments, it also offers unique possibilities for scientific 
research: The VO framework requires an explicit descrip- 
tion of the data that it provides with no implicit assump- 
tions about the data themselves. This requirement is fun- 
damental since interoperability is performed by machine 
to machine communications without supervision by hu- 
man beings. Such a description is performed by the data 
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Figure 1. IVOA spectrum data model. 



model working group in the International Virtual Obser- 
vatory Alliance (IVOA). 

The rationale of the data model working group is to pro- 
vide an abstract description of concepts and their interre- 
lationships, used to fix both the names and meanings of 
concepts in the VO context and also their internal struc- 
ture and cross-connections (IVOA Spectral Data Model 
1.01, 2007J3, that is to describe all the possible universes 
of classes of data. Note that such description and inter- 
relationships are fundamental in scientific research also, 
with the only difference that, in the last case, researchers 
are focused on particular types of objects. 

Figure[T]shows the spectrum data model generated by the 
data model working group. This data model aims to de- 
scribe all the universe of tables in the VO that contain a 
spectrum, hence the data model must not only be valid 
for observed data, but also for theoretical spectra. 

This data model (the generic description of a spectrum) 
allows to formulate new questions for researchers that ei- 
ther produce or use theoretical spectra: Does this data 
model describe theoretical spectra? Can all the "boxes" 
in the data model be filled? And, if this is the case, what is 
the accuracy of a theoretical model? Do theoretical mod- 
els have systematic errors? And, do they have statistical 
errors? 



et /Document s / WD /DM/ Spect rumDM-20 07 05 15 . html 



2 




i a i.e 4.4 iz 4 a.a a.B 



Figure 2. Draft illustration of the concepts of the IVOA 
characterization data model for a typical atmosphere li- 
brary like those used in populations synthesis showing 
the resolution of the library (precision in the VO data 
model) and its coverage (support in the VO data model). 
Isochrones are also drawn for an intuitive comparison of 
the support of both sets of data. 

In addition to the spectrum data model, this working 
group has also proposed a data model for astrophysical 
dataset characterization (IVOA Data Model for Astro- 
nomical DataSet Characterization 1.11, 2007fl This data 
model defines the information required to describe the 
parameter space of observed or simulated astronomical 
datasets. Fig. |2]is a draft illustration of the description of 
an atmosphere library like those used in population syn- 
thesis. 

Again, a correct characterization of models is fundamen- 
tal for their application to observations. All theoretical 
models have a limited range of validity, given by the com- 
putational algorithms, the resolution of the input data and 
other factors. Usually, these limitations are described 
qualitatively in the reference papers, but unfortunately, 
it is not so common for these limitations to be intro- 
duced in the code that computes the models (Fortunately, 
there are som e significant exceptions, like CLOUDY by 
iFerland et al.lll998l) . 

We stress again that an answer of the previous issues is 
fundamental in scientific research, since the basic way 
to obtain the physical parameters of an observed system 
is to compare observational data with theoretical models. 
Only if these questions are solved in a quantitative way, 
will it be possible to evaluate the accuracy (not the preci- 
sion) of our knowledge of the observed system. 

In the case of stellar populations, theoretical mod- 
els are provided by stel la r pop u lation synthesis 
(iFanelli. O'ConneU & Thuanl Il987li iTinslev & Guml 
119761) . The general case of population synthesis 
aims to decompose the observed light of the system 
under study in different stellar populations compo- 
nents, that is the contribution of classes of stars, each 
class with different effectiv e temp erature and gravity 
(IFanelli. O'ConneU & Thuanl Il987l) . In this case, no 
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further information about the observed system can be 
obtained unless additional a priori information is consid- 
ered. In the case of e volutionary population synthesis 
dTinslev & Gunnl[l976l) . the contribution of the different 
stellar classes is defined by the evolutionary status of the 
system, hence the knowledge of the contribution of the 
different stellar classes can be transformed into knowl- 
edge about the age, metallicity, stellar birth-rate (i.e. star 
formation history, initial mass function and the amount 
of formed stars) of the system. Although evolutionary 
synthesis models are powerful, it is necessary to take into 
account that: 

• The transformation between the contribution of stel- 
lar classes and the associated physical parameters 
requires some a priori hypotheses: a set of atmo- 
sphere models, evolutionary tracks and defined star 
formation properties (i.e. stellar birth rate). 

• There are different evolutionary conditions that re- 
sult in the same integrated spectra, so the physical 
parameters obtained from the analysis of observa- 
tions are not necessarily unique. 

These a priori hypotheses, together with the computa- 
tional algorithms and approximations used by different 
synthesis codes, define the range of use (i.e. character- 
ize) of the results. They are also the principal source of 
systematic errors in model computations. On the other 
hand, the degeneracy of the possible solutions produces 
an intrinsic precision loss in any fit of observational data 
and theoretical models: the best-fit solutions are not nec- 
essarily the correct ones and the distribution of the good- 
ness of fit is also needed for a correct interpretation of the 
observed data. In order to analyze these issues in detail, 
we need to understand what a synthesis model is. 



2. WHAT IS A SYNTHESIS MODEL? 



The first step needed to evaluate systematic errors, disper- 
sion and the range of application in population synthesis 
models is to understand how synthesis models work. The 
main result of synthesis models is the luminosity due to 
an ensemble of n individual stars, each with a given lu- 
minosity, £i. The total luminosity of the ensemble is: 



L tot = J"li=n- y yi i = n<l> . (1) 

i=l i=l 

where < I > is the estimation of the mean luminosity of 
an individual star. 

However, in a system where individual stars are not re- 
solved, we cannot perform such a sum, since we do not 
know the individual £i values. In this case, we can obtain 
the total luminosity as the composition of different stellar 
populations (or different stellar types). If we assume that 
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Figure 3. Mean spectrum and 90% confidence interval 
region of clusters with 10 5 M Q transformed into stars, 
with a given metallicity and a given age. 
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Figure 4. Mean spectrum of clusters with 10 5 Mq trans- 
formed into stars, with a given metallicity and a wide 
range of ages. 



a given stellar type i can be represented by a luminos- 
ity li and we have some rules that give us the proportion 
Wi between different stellar types for different physical 
parameters (wi = Wi[t, Z, IMF, SFH]), then 



Lgf°(n) = n Wik = n <l >, (2) 

i=l 

Wcla B3 

^Wi = l. (3) 

i=l 

where < £ > is the mean (or average) luminosity of an 
average star that represents the ensemble. Note that this 
mean luminosity is not necessarily the luminosity of any 
real star, but it is a description of an underlying prob- 
ability distribution of the possibles luminosities of real 
stars in the ensemble. As a trivial example, remember the 
half-alive half-dead Schrodinger's cat solution in quan- 
tum mechanics. 

Although Eq. Q]and Eq. [2]look similar, they do not pro- 
vide the same quantity: The first case is the actual inte- 
grated luminosity of a given cluster and the second case 
is the mean integrated luminosity of all the possible clus- 
ters with n stars and in the same evolutionary conditions. 
Note also that the problem in population synthesis is not 
the classical statistical one of estimating n < £ >, but the 
inverse one: obtaining the possible theoretical n < £ > 
values compatible with the observed integrated luminos- 
ity L tot . So, it is necessary to use the complete distribu- 
tion of theoretical integrated luminosities rather than its 
mean value L*JjJr°(n). This distribution provides a metric 
for goodness-of-fit to stellar population studies. 

As an illustration of the effects of this intrinsic disper- 
sion, we show in Fig. [3] the 90% confidence interval of 
the integrated luminosity of a stellar cluster with 10 5 M@ 
transformed into stars for a given stellar birth rate, metal- 
licity and age. It represents the integrated spectrum of all 



possible clusters with this number of stars, and evolution- 
ary conditions. Fig. |4]show the mean spectrum of a clus- 
ter with similar conditions but different ages. The 90% 
confidence interval in Fig. [3] with a single age, covers 
all the region defined by the mean values of the spectra at 
different ages. Any fit of a real spectrum of such a clus- 
ter would produce a very precise result for the age, but 
the accuracy of this fit cannot be known if the underly- 
ing probability distributions of integrated luminosities at 
different ages are not taken into account. 

We refer to ICervifio & Luridianal d2006l) for a more ex- 
tensive analysis of th e probabilistic formulation o f pop- 
ulation synthesis and lCervino & Luridianal d2007bl) for a 
more complete description about the metrics for good- 
ness of fit. 



3. SYSTEMATIC ERRORS IN SYNTHESIS MOD- 
ELS 



Once the origin of the intrinsic dispersion of population 
synthesis models has been described, we are going to dis- 
cuss the sources of systematic errors and how to take 
advantage of the IVOA Characterization data model to 
define the region of application of synthesis models in 
its para meter space. We refer to ICervifio & Luridianal 
(2007a) for a more complete review on the systematic er- 
rors. 

In the following, we will only consider the systematic er- 
rors related to stellar population synthesis computations 
itself, assuming that the systematic errors of the inputs 
are known. The main source of systematic errors are: 



1. Atmosphere libraries If. systematic errors in atmo- 
sphere libraries mainly depend on how realistic our 
assumption about the classification of stars in the 
different stellar types are (see Garcia- Vargas' et al. 
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contribution in these proceedings). Such an evalua- 
tion must be implemented by synthesis models mak- 
ers. 

In addition to the systematic errors, the range of ap- 
plication of synthesis models depends in the cover- 
age on the parameter space (logg, T e g, metallicity, 
...), that is the characterization of the library in this 
parameter space. 

2. Tracks/Isochrones and their combination with the 
star formation history nf. In the case of synthe- 
sis models, the shape of isochrones and the den- 
sity of stars along the isochrone must fit observa- 
tional data. Small variations in these densities can 
produce big differences in the models. Hence, it is 
crucial to know the uncertainties in the evolutionary 
tracks/isochrones used to evaluate the systematic er- 
ror in synthesis models. Currently, there are differ- 
ent groups working on the evaluation of systematic 
errors in evolutionary tracks, like Degl'Innocenti et 
al. and Bressan et al. 

Additionally, since synthesis models describe the 
whole population, they also need to take into ac- 
count evolutionary phases that are included in stan- 
dard evolutionary tracks, like pre-main sequence 
evolution, final evolutionary stages like SN or cool- 
ing white dwarfs, etc. In general, synthesis models 
makers combine tracks of different groups to obtain 
a good coverage in evolutionary phases, but in doing 
so, they include a potential source of error due to dif- 
ferent physics used in track computations. The eval- 
uation of the associated systematic errors and the 
coverage of the models results due to these effects 
must be implemented by synthesis models makers. 

3. Synthesis models algorithms: the final issue is the 
algorithms used in the model. The main problem 
is how to combine the different coverage and sam- 
pling (support and precision respectively, following 
the Characterization data model) of the atmosphere 
libraries and isochrones in the log g — log T c s plane 
(see Fig. |2). In some cases, it is necessary to extrap- 
olate the behavior of atmosphere libraries to regions 
not covered by the libraries. In addition, there are 
also differences in how the correspondence of the 
points in the isochrones and in the atmosphere li- 
braries is done, ranging from how the libraries are 
interpolated to how the closest model to the given 
isochrone point is chosen. 



Although the characterization of synthesis models is a 
task under development, it is currently possible to de- 
fine its range of application by comparing the coverage of 
isochrones and atmosphere libraries and using the IVOA 
characterization data model for this task. It will not be 
the final solution of the problem of coverage, but it will 
enormously help towards the use of synthesis models in 
their correct range of application. 



4. CONCLUSIONS 



Synthesis models, as any other theoretical model, are af- 
fected by systematic errors. Although the evaluation of 
such errors is a difficult task that only can be performed 
by synthesis models makers, the current IVOA descrip- 
tion of spectrum data model allows to add these errors to 
the output. 

In the case of statistical errors, the spectral data model 
does not apply directly to theoretical results since the goal 
of statistics is inferring the underlying distribution from 
which the data have been drawn, which is assumed to be 
known in the computation of theoretical models. How- 
ever, the StatError box in the current spectral data model 
can be used to describe the intrinsic dispersion of theo- 
retical models. The dispersion of synthesis models, al- 
though not recognized as such, is computed by several 
codes under the label of surface brightness fluctuations, 
so this piece of information is already available. 

Finally, a proper characterization of isochrones and at- 
mosphere libraries coverage allows to obtain a realistic 
characterization of synthesis models, and its range of ap- 
plication. 
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